SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: WAHL, DR. , GEOFFREY M. 

O ' GORMAN DR., STEPHEN V. 

(ii) TITLE OF INVENTION: FLP -MEDIATED GENE MODIFICATION 

MAMMALIAN CELLS, AND COMPOSITIONS AND CELLS USEFUL 
THEREFOR 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PRETTY, SCHROEDER, BRUEGGEMANN & CLARK 

(B) STREET: 444 South Flower Street, Suite 2000 

(C) CITY: Los Angeles 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 90071 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/666,252 

(B) FILING DATE: 08-MAR-1991 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: REITER MR. , STEPHEN E. 

(B) REGISTRATION NUMBER: 31192 

(C) REFERENCE/DOCKET NUMBER : P31 8929 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (619) 535-9001 

(B) TELEFAX: (619) 535-8949 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1380 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: NATIVE FLP 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1269 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATG CCA CAA TTT GAT ATA TTA TGT AAA ACA CCA CCT AAG GTG CTT GTT 
Met Pro Gin Phe Asp lie Leu Cys Lys Thr Pro Pro Lys Val Leu Val 
15 10 15 

CGT CAG TTT GTG GAA AGG TTT GAA AGA CCT TCA GGT GAG AAA ATA GCA 
Arg Gin Phe Val Glu Arg Phe Glu Arg Pro Ser Gly Glu Lys lie Ala 
20 25 30 

TTA TGT GCT GCT GAA CTA ACC TAT TTA TGT TGG ATG ATT ACA CAT AAC 
Leu Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met lie Thr His Asn 
35 40 45 

GGA ACA GCA ATC AAG AGA GCC ACA TTC ATG AGC TAT AAT ACT ATC ATA 
Gly Thr Ala He Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr He He 
50 55 60 

AGC AAT TCG CTG AGT TTC GAT ATT GTC AAT AAA TCA CTC CAG TTT AAA 
Ser Asn Ser Leu Ser Phe Asp He Val Asn Lys Ser Leu Gin Phe Lys 
65 70 75 80 

TAC AAG ACG CAA AAA GCA ACA ATT CTG GAA GCC TCA TTA AAG AAA TTG 
Tyr Lys Thr Gin Lys Ala Thr He Leu Glu Ala Ser Leu Lys Lys Leu 
85 90 95 

ATT CCT GCT TGG GAA TTT ACA ATT ATT CCT TAC TAT GGA CAA AAA CAT 
He Pro Ala Trp Glu Phe Thr He He Pro Tyr Tyr Gly Gin Lys His 
100 105 110 

CAA TCT GAT ATC ACT GAT ATT GTA AGT AGT TTG CAA TTA CAG TTC GAA 
Gin Ser Asp He Thr Asp He Val Ser Ser Leu Gin Leu Gin Phe Glu 
115 120 125 

TCA TCG GAA GAA GCA GAT AAG GGA AAT AGC CAC AGT AAA AAA ATG CTT 
Ser Ser Glu Glu Ala Asp Lys Gly Asn Ser His Ser Lys Lys Met Leu 
130 135 140 

AAA GCA CTT CTA AGT GAG GGT GAA AGC ATC TGG GAG ATC ACT GAG AAA 
Lys Ala Leu Leu Ser Glu Gly Glu Ser He Trp Glu He Thr Glu Lys 
145 150 155 160 

ATA CTA AAT TCG TTT GAG TAT ACT TCG AGA TTT ACA AAA ACA AAA ACT 
He Leu Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr Lys Thr Lys Thr 
165 170 175 



# 



TTA TAC CAA TTC CTC TTC CTA GCT ACT TTC ATC AAT TGT GGA AGA TTC 
Leu Tyr Gin Phe Leu Phe Leu Ala Thr Phe lie Asn Cys Gly Arg Phe 
180 185 190 

AGC GAT ATT AAG AAC GTT GAT CCG AAA TCA TTT AAA TTA GTC CAA AAT 
Ser Asp lie Lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gin Asn 
195 200 205 

AAG TAT CTG GGA GTA ATA ATC CAG TGT TTA GTG ACA GAG ACA AAG ACA 
Lys Tyr Leu Gly Val lie lie Gin Cys Leu Val Thr Glu Thr Lys Thr 
210 215 220 

AGC GTT AGT AGG CAC ATA TAC TTC TTT AGC GCA AGG GGT AGG ATC GAT 
Ser Val Ser Arg His lie Tyr Phe Phe Ser Ala Arg Gly Arg lie Asp 
225 230 235 240 

CCA CTT GTA TAT TTG GAT GAA TTT TTG AGG AAT TCT GAA CCA GTC CTA 
Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu 
245 250 255 

AAA CGA GTA AAT AGG ACC GGC AAT TCT TCA AGC AAT AAA CAG GAA TAC 
Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gin Glu Tyr 
260 265 270 

CAA TTA TTA AAA GAT AAC TTA GTC AGA TCG TAC AAT AAA GCT TTG AAG 
Gin Leu Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys 
275 280 285 

AAA AAT GCG CCT TAT TCA ATC TTT GCT ATA AAA AAT GGC CCA AAA TCT 
Lys Asn Ala Pro Tyr Ser lie Phe Ala lie Lys Asn Gly Pro Lys Ser 
290 295 300 

CAC ATT GGA AGA CAT TTG ATG ACC TCA TTT CTT TCA ATG AAG GGC CTA 
His lie Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu 
305 310 315 320 

ACG GAG TTG ACT AAT GTT GTG GGA AAT TGG AGC GAT AAG CGT GCT TCT 
Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser 
325 330 335 

GCC GTG GCC AGG ACA ACG TAT ACT CAT CAG ATA ACA GCA ATA CCT GAT 
Ala Val Ala Arg Thr Thr Tyr Thr His Gin He Thr Ala He Pro Asp 
340 345 350 

CAC TAC TTC GCA CTA GTT TCT CGG TAC TAT GCA TAT GAT CCA ATA TCA 
His Tyr Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr Asp Pro He Ser 
355 360 365 

AAG GAA ATG ATA GCA TTG AAG GAT GAG ACT AAT CCA ATT GAG GAG TGG 
Lys Glu Met He Ala Leu Lys Asp Glu Thr Asn Pro He Glu Glu Trp 
370 375 380 

CAG CAT ATA GAA CAG CTA AAG GGT AGT GCT GAA GGA AGC ATA CGA TAC 
Gin His He Glu Gin Leu Lys Gly Ser Ala Glu Gly Ser He Arg Tyr 
385 390 395 400 

CCC GCA TGG AAT GGG ATA ATA TCA CAG GAG GTA CTA GAC TAC CTT TCA 
Pro Ala Trp Asn Gly He He Ser Gin Glu Val Leu Asp Tyr Leu Ser 
405 410 415 

TCC TAC ATA AAT AGA CGC ATA TAAGTACGCA TTTAAGCATA AACACGCACT 
Ser Tyr He Asn Arg Arg He 
420 



• 
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ATGCCGTTCT TCTCATGTAT ATATATATAC AGGCAACACG CAGATATAGG TGCGACGTGA 1359 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Pro Gin Phe Asp lie Leu Cys Lys Thr Pro Pro Lys Val Leu Val 
15 10 15 

Arg Gin Phe Val Glu Arg Phe Glu Arg Pro Ser Gly Glu Lys lie Ala 
20 25 30 

Leu Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met lie Thr His Asn 
35 40 45 

Gly Thr Ala lie Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr lie lie 
50 55 60 

Ser Asn Ser Leu Ser Phe Asp lie Val Asn Lys Ser Leu Gin Phe Lys 
65 70 75 80 

Tyr Lys Thr Gin Lys Ala Thr lie Leu Glu Ala Ser Leu Lys Lys Leu 
85 90 95 

He Pro Ala Trp Glu Phe Thr He He Pro Tyr Tyr Gly Gin Lys His 
100 105 110 

Gin Ser Asp He Thr Asp He Val Ser Ser Leu Gin Leu Gin Phe Glu 
115 120 125 

Ser Ser Glu Glu Ala Asp Lys Gly Asn Ser His Ser Lys Lys Met Leu 
130 135 ' 140 y 

Lys Ala Leu Leu Ser Glu Gly Glu Ser He Trp Glu He Thr Glu Lys 
145 150 155 160 

He Leu Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr Lys Thr Lys Thr 
165 170 175 

Leu Tyr Gin Phe Leu Phe Leu Ala Thr Phe He Asn Cys Gly Arg Phe 
180 185 190 

Ser Asp He Lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gin Asn 
195 200 205 

Lys Tyr Leu Gly Val He He Gin Cys Leu Val Thr Glu Thr Lys Thr 
210 215 220 

Ser Val Ser Arg His He Tyr Phe Phe Ser Ala Arg Gly Arg He Asp 
225 230 235 240 

Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu 
245 250 255 

Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gin Glu Tyr 



ACAGTGAGCT GTATGTGCGC A 



1380 



260 



265 



270 



• 



Gin Leu Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys 
275 280 285 

Lys Asn Ala Pro Tyr Ser lie Phe Ala lie Lys Asn Gly Pro Lys Ser 
290 295 300 

His lie Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu 
305 310 315 320 

Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser 
325 330 335 

Ala Val Ala Arg Thr Thr Tyr Thr His Gin lie Thr Ala He Pro Asp 
340 345 350 

His Tyr Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr Asp Pro He Ser 
355 360 365 

Lys Glu Met He Ala Leu Lys Asp Glu Thr Asn Pro He Glu Glu Trp 
370 375 380 

Gin His He Glu Gin Leu Lys Gly Ser Ala Glu Gly Ser He Arg Tyr 
385 390 395 400 

Pro Ala Trp Asn Gly He He Ser Gin Glu' Val Leu Asp Tyr Leu Ser 
405 410 415 

Ser Tyr He Asn Arg Arg He 
420 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: FLP recombination target site 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GAAGTTCCTA TTCTCTAGAA AGTATAGGAA CTTC 



34 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Synthetic oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
GATCCCGGGC TACCATGGAG AAGTTCCTAT TCCGAAGTTC CTATTCTCTA GAAAGTATAG 
GAACTTCA 



